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I.  Introduction 


The  Poisson  channel  is  an  additive  noise  channel  with  output  Yt  =  Xt  4-  Nt  where 
N —{Nt  }o<i<r  is  the  channel  noise  and  X  =  {Xt}o<t<r  is  the  transmitted  signal  into  which  is 
encoded  the  message  0={0t  }o<t<r-  X  is  sometimes  viewed  as  the  output  of  the  channel  encoder. 
All  processes  in  the  channel  model  are  defined  on  a  common  probability  space  (Cl,F  ,P).  We  write 
F'  for  the  natural  history  of  6,  FN  for  the  natural  history  of  N ,  etc. 

Poisson-type  point  processes  [6]  are  a  class  of  simple  point  processes  in  which  the  compensa¬ 
tor  takes  the  form 

t 

fa,  db{s) 
o 


where  a,  is  some  nonnegative  predictable  process  (the  intensity  of  the  point  process)  and  b  (t )  is  a 
deterministic  nonnegative  right-continuous  and  nondecreasing  function.  In  the  Poisson  channel 
(often  called  the  Poisson-type  point  process  channel)  both  X  and  N  are  Poisson-type  point 
processes.  We  shall  only  consider  Poisson-type  point  processes  with  6  (t )  =  l .  The  noise  N  in  the 
Poisson  channel  is  given  to  be  directed  by  the  .//-predictable  noise  intensity  At  where  H  is  a  his¬ 
tory  such  that  FtN  C  Ht  for  all  t  e  [0,  T).  X  is  given  to  be  directed  by  the  JF,v  Fr -predictable 
encoding  intensity  \t  -  Then  the  channel  output  Yt  =  Xt  +  Nt  is  also  a  Poisson-type  point  pro¬ 
cess  directed  by  the  intensity  rj,  =  \t  +  A(  where  is  predictable  with  respect  to  the  global  his¬ 
tory  F,'r  H v  Fr .  In  the  Poisson  channel  model,  the  message  6  is  encoded  into  the  channel 
encoder  output  A',  indirectly  via  the  encoding  intensity  Xt  =  Xi(0>^)-  Ftlv  ,Ff ^predictability  of 
Xt  allows  for  nonanticipative  message  encoding  and  causal,  noiseless,  instantaneous  feedback. 


The  augmented  noise  history  H  is  introduced  into  the  channel  model  so  that  the  whole  class 
of  Poisson-type  point  processes  with  integrating  function  b  (t )  —  t  can  be  considered  as  models 
for  the  channel  noise.  In  particular,  renewal  processes,  self-exciting  processes  and  doubly  stochas¬ 
tic  Poisson  processes  fall  into  this  class  [7],  For  instance,  N  is  a  doubly  stochastic  Poisson  process 
for  Ht  =  irrAv  FtN  (suitably  completed). 


In  the  absence  of  constraints  on  the  encoder  output,  the  information  capacity  of  the  Poisson 
channel  is  infinite.  A  peak  constraint  0  <  Xt  <  c  is  generally  imposed  on  the  encoder  intensity  [l], 
[2],  [3].  An  average  constraint 

T 

E[fXtdt\  <  k0T 
o 


has  also  been  considered  [l],  [3].  These  and  other  encoder  constraints  are  considered  in  Section  V 
where  some  justification  is  given  for  emphasis  on  the  peak  constraint  in  this  and  earlier  papers  [l], 

[2],  [3]. 


To  define  channel  capacity  let  fi$,  fty ,  and  (i$r  be  the  marginal  and  joint  measures  induced 
by  the  message  and  output  processes,  6  and  Y ,  on  the  spaces  Se,  Sy  ,  and  SjXSy  where  S«  and 
Sy  are  the  spaces  of  trajectories  of  $  and  Y  over  the  interval  [0,T].  Write  the  induced  product 
measure  as  iisxy  ■  Then,  the  average  mutual  information  in  0  and  Y  over  the  interval  [0,T]  is  [5] 


IT[e,Y\  =  E 


In' 


d  ntY 
d  Hi 


XY 


n  For 
\JfcI 


jed 


provided  h»y  «  y  I  otherwise  IT  [0,Y ]  =  oo.  Using  an  obvious  notation,  the  conditional  mutual 
information  [5]  in  0  and  Y  given  a  trajectory  of  the  noise  intensity  is  _ 

d  fitY  |  a  , 


IT[e,Y  I  Aj  =  £ 


In 


d  fitxr  i  a 
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The  channel  information  capacity  is 

<P  =  S"P  s£p  \lT[8,Y\ 

where  the  suprema  are  taken  over  all  admissible  message  processes,  6,  and  all  admissible  channel 
encodings,  X.  The  class  of  admissible  message  processes  is  generally  taken  to  be  all  jointly 
measurable  real-valued  processes  with  index  set  [0,7’].  A  further  comment  on  admissibility  of 
messages  and  encodings  for  the  Poisson  channel  can  be  found  in  [2]. 

We  shall  assume  that  the  message  and  noise  are  independent.  Specifically,  it  is  assumed  that 
the  histories  H  and  F1  are  independent.  This  assumption  has  the  essential  consequence  that  the 
average  mutual  information  in  the  channel  is  upper-bounded  by  the  expected  conditional  informa¬ 
tion  given  the  path  of  the  noise  intensity.  Independence  of  the  histories  H  and  F *  also  implies 
that,  in  the  no-feedback  case,  Xt  and  A t  are  independent  processes. 

The  capacity  of  the  Poisson  channel  has  been  previously  treated  for  cases  in  which  the  noise 
intensity  is  a  real  constant  A<  =  X  [2],  [3]  and  for  cases  in  which  the  noise  intensity  is  a  deter¬ 
ministic  function  of  time  At  =  X(t )  [l].  Applications  of  and  further  references  to  the  Poisson 
channel  model  are  given  in  [4].  We  refer  to  [l]  for  motivation  for  treating  the  channel  noise  inten¬ 
sity  as  a  stochastic  process. 

It  will  be  convenient  to  have  notations  for  certain  conditional  expectations.  For  any  ran¬ 
dom  variable  z  define 

i  —  E[z  \Ftl], 

!  =  E[z  |  Ffi  'Ft)  , 

5  =E[z  |  F/.-F/.}. 
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II.  Average  Mutual  Information 
for  Random  Noise  Intensity 


The  average  mutual  information  in  the  Poisson  channel  with  random  noise  intensity  is  given 
by  Liptser  and  Shiryayev  [6]  to  be 

/  T  [9,  Y ]  =  E  [/(te  (xt  +  A  t )  ~  u>  (x«  +  A,  ))dt\  (l) 


where  w(x)  —  x  Inx .  Note  that  \t  —  Xt  ■  For  the  case  of  nonrandom  noise  intensity, 
A,  =  A  t  =  A, .  In  this  case  the  usually  intractable  calculation  of  A,  and  A  ,  is  avoided  and  (l)  is 
greatly  simplified.  In  the  case  of  stochastic  noise  intensity,  the  presence  of  At  and  A  ,  limits  the 
direct  usefulness  of  the  expression  for  IT  [fl,yj.  The  conditional  information  given  the  path  of  the 
noise  intensity 


IT[e,Y  \A]  =  E 


f(w(xt  +A ,)-w(xt  +A ())dt  |  Ft 


sidesteps  this  difficulty.  We  have  the  following  inequality: 


Proposition  1:  Suppose  the  message  and  noise  intensity  processes  0  and  A  are  independent. 

Then 


It[0,Y]  <  E 


T 

f(w(Xt  +A  t)-w(x,  +A  ,))dt 


o 


Proof:  This  is  a  special  case  of  a  general  inequality  which  follows  from  Kolmogorov’s  for¬ 
mula  [8]: 

1[X,Z\  +  E\I\X,Y  I  Z]}  =  I[X,{Y,Z)) 

for  random  elements  X,  Y,  Z  provided  Pxyz  «  Px'XVyz  ■  Independence  of  X  and  Z  implies 
I[X ,Z]  =0.  Also,  I[X ,(Y ,Z)}  >  /[X,y]  so,  in  the  context  of  the  Poisson  channel, 

IT[0,Y]<E[IT[0,Y  |  A]]. 


Proposition  2: 


fr 


It[0,Y\  >  E 


f(u>(xt  +  A,)-U)(x,  +  A,  ))dt 
.0 


Proof:  Using  Jensen’s  inequality,  we  have  E[w(xt  +  A, )]  >E[w(xt  +  A< )].  Use  this  ine¬ 
quality  in  (1)  and  the  result  follows. 

Let  $  be  the  class  of  marginally  stationary  noise  intensities  At  such  that  |  A  t  -  A,  |  — ►  0 
a  s.  as  t  — ►  oo.  Define 

=  {A(  e  4>:  E  \w  (A, )]  <  oo}  , 

$£  =  {A|€  4>:  A(  is  bounded}  . 

Let  L  >  0  be  a  r.v.  such  that  L  is  bounded  or  E[L  InL  ]  is  finite.  Then  A *  =  L  belongs  to 
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or  .  More  generally,  let  p(<)  be  a  bounded  nonnegative  periodic  function  with  period  t0  and 
suppose  IT  is  uniformly  distributed  over  [0,/0]  and  independent  of  L  .  Then  A,  =  Lp  (t  -II)  belongs 
to  or  . 


Proposition  8:  Suppose  A,  e  <1*5  .  Then,  for  the  Poisson  channel 

'T 


lim  ±-1t[61Y\=  lim  ±E 

T  — »oo  1  T— *oo  I 


f{w(Xt  +A,)-u)(x«  +  A,  ))dt 
o 


Proof-.  Use  the  upper  and  lower  bounds  for  IT[0,Y)  given  in  Propositions  1  and  2  and  the 
result  follows. 

According  to  Proposition  3, 

lim  ±IT[OtY\  =  lim  ±=E[It[9,Y  |  A]] 

T  — *oo  i  T  -»oo  1 

for  noise  intensities  At  €  u  . 
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III.  Channel  Capacity 


In  this  section  we  give  capacity  results  for  the  Poisson  channel  with  and  without  feedback. 
For  clarity,  we  content  ourselves  with  treating  marginally  stationary  noise  intensities;  that  is, 
noise  intensities  having  a  common  marginal  distribution  F(z  )  =  P  {At  <x  }  for  all  t  e[0,T].  How¬ 
ever,  the  results  and  their  proofs  given  below  extend  in  a  natural  way  to  nonstationary  noise 
intensities.  Also,  we  only  impose  a  peak  constraint  on  the  encoder  intensity.  Using  the  method  of 
Davis  [3],  the  extension  to  encoder  intensities  both  peak-  and  average-constrained  is  obvious.  We 
also  note  that  the  results  given  here  for  a  constant  peak  constraint  on  the  encoder  intensity  can 
be  adjusted  to  apply  to  a  time-varying  peak  constraint  by  simple  function  approximation  of  the 
peak  constraint  as  in  [l]. 


Th  eorem  Let  the  noise  process  Nt  in  a  Poisson  channel  have  marginally  stationary  noise 
intensity  At  with  marginal  distribution  F(t)  —  P  { At  <x  }  and  suppose  A,  is  independent  of  the 
message  process  9t  .  Suppose  the  encoder  output  intensity  Xi  is  peak-constrained,  0  <  Xt  <  c  and 
require  to  be  F1''  FY  -adapted  so  that  causal  feedback  is  possible.  Then 

C  <  Bj  =E[C{A,)) 


where 


Proof:  By  Proposition  1 


IT[e,Y)  <  E 


E  [J(4>(xt  At )  -  4>(xt  At  ))dt  |  A,  ] 


From  [l],  for  nonrandom  noise  intensity  X(t ),  we  have 

T 

/Mx«  .MO)  -  0(x<  .MO))* 

0 


so,  for  marginally  stationary  A, , 


-~iT[e,Y}<E 


I 


<fC(\(t))dl 

o 


—  E[C  (A, )]  . 


Theorem  5:  Consider  the  Poisson  channel  presented  in  Theorem  4  but  now  restrict  the 
encoding  intensity  to  be  .F#-adapted  so  that  no  feedback  is  allowed.  Then 


where 


with 


®<S./  =  i““i  E[h(p  ,  A,  )] 
7^-? 


h  (p  ,  A, )  =  p  <t>{c  At )  -  4>{pc  At ) 


<f>(x  ,y  )  =  (z  +  y  )ln(z  +  y )  -  y  lny  . 
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Proof-.  Xi  is  F*-adapted  so  x«  and  A<  are  independent.  Thur  E  [x,  |  At  ]  =  FT  [x*  ] -  Then,  by 
Jensen’s  inequality  we  have  E  [to  (x(  +  A, )]  >  E[w  (E  [xt  ]  +  A, )].  So,  by  Proposition  1, 

T 

IT{9,Y}<J{E  [4>(Xt  A  )]  -  E  [< <t>(E  [x, ] »At )] )  dt  . 
o 

Let  Ao  be  A(  for  t  —  0.  The  independence  of  At  and  x«  together  with  the  marginal  stationarity 
of  A,  gives  E  Mxi  ,A, )]  =  E  [0(x<  A))]-  Also  E  \<HE  [x*  ],A, )]  =  E  \<j>{E  [x,  ]A>)]-  Then 

T 

IT[e,Y)<J{E  Mx,  A)}  -  E  \<HE  [x*  ],Ao)] )  dt  . 


yIT[9,Y]  <  t  f[o,*r|  [E[<f>{xt  >Ao))  -  E  [<j>{E  [x»  ],Ao)j )  • 

In  the  supand  above,  t  enters  only  through  Xt  ■  Let  fx(A  )=P»  Xt-1(A  )•  The  support  of  fj  is  con¬ 
tained  in  [0,c  ].  Define 

e 

Q^9)  =  fg{x)»(dx)  . 
o 

Then  Q  „(i )  ==  E  [x«  ]  where  »  is  the  identity  function.  Also  Q ^{E  [<£0])  =  E  \<!>{xt  ,Ao)]  using  the 
independence  of  Xt  and  A-  We  can  now  write 


<c<sJ&  „(«))]] 


where  M  is  the  set  of  all  probability  measures  on  ([0,c  ],J5  [[0,c  ]]).  For  any  there  is  a 

£e[0,c]  such  that  Q ^(i)  ==  k  so 


^  SUP 


- SB  e»(£  w)-ow*)] 


Q  fE  is  linear  so 


2 

The  function  d>(fx  )  -  —  <f>0(e  )  is  maximum  at  x  =0  and  z  =  c  (for  any  value  of  Ao-)  Thus 

E[QMo~~i)\ 

is  maximized  for  Q  ^(i  )=Jfc  by  the  probability  measure 

e({o»  -  ,  /«({« })  -  4 


in  which  case  E[Q  ^ 0 - -«)].  So 


=  olpli  [pE  Me  ,Aq)]  -  E  [4>{pc  A,)]  ] 
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=  o <HiE[h  (p  » At )]  . 

For  0  <  p  <  1,  H(p  )  =  E  [A  (p  ,  A, )]  has  the  first  and  second  derivatives 

*2-  =  E\<t>{cX))-c  -  cE  [ln(pe  +  A, )] 


=  cE  [ln(A'l  +  A, )}  -  cE  [ln(pe  +  At )] 


where  Kt  =  k  (A, )  with 


£»  -  -.‘E  — L- 

dp2  pc  +At 


H(p  )  is  concave  and  so  has  a  unique  maximum  over  p  €  [0,1].  To  see  that  H(p  )  has  its  maximum 
in  the  subinterval  p€  [l/e  , l/2]  we  just  note  that  according  to  (2),  pm„  maximizing  H(p)  must 
satisfy 


E  \nP7~'-  =0 
Kt+  At 


and  the  range  of  A:  is  [c  ft  ,c  /2].  Thus 


o< *P<iE  [A  (p  ,  A, )]  =  ±™±E  [A  (p  ,  A, )]  . 


The  proof  is  complete. 


Of  course,  Bnj  <  Bj  since 


Bn,  -A “^[A(p,A,)] 


Bf  =E[C(At))  =  E  ^A(p,A()]  . 


Theorem  6:  Let  the  noise  intensity  A,  in  a  Poisson  channel  belong  to  or  $b  ■  Suppose 
the  encoder  intensity  is  peak-constrained  and  no  feedback  is  allowed.  Then  C  — »  Bnj  as  T  —*  oo. 

Proof:  There  exists  a  sequence  of  message  processes  {^m^}  [2]  such  that  [l] 

T 

lim  IT\6[m\Y  | A]  =  Jh(PM)dt 

m  —♦oo  o 

where  p  €  [0,1]  is  a  parameter  of  the  process  By  Proposition  3 

r 

lim  lim  ±IT  [tm\Y]  =  lim  ±JE  [A  (p  ,At  )}dt 

m  —*oo  T  -+oo  1  T  — *oo  J  q 
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A(  is  marginally  stationary  so 

lira  lira  ±IT[<tm\Y}  =  E[h(p,At)}. 

m  — »oo  T  —»oo  1 

This  is  true  for  all  p€  [0,l]  so 

lim  C  >  pl'ifii#  [A  (p  ,  At )]  . 

T  — *oo 

This  together  with  Theorem  5  proves  the  result. 

Efficient  communication  through  the  Poisson  channel  requires  that  the  encoder  be  "tuned” 
to  the  channel;  i.e.,  that  the  encoder  be  adjusted  in  accordance  with  the  amount  of  noise  present. 
This  requires  knowledge  of  the  channel  noise  intensity.  In  cases  where  the  noise  intensity  is  ran¬ 
dom  the  state  of  the  noise  intensity  process  is  unknown  to  the  sender.  An  encoding  scheme  incor¬ 
porating  feedback  is  then  needed  whereby  the  sender  makes  a  feedback-based  estimate  of  the 

noise  intensity  and  then  adjusts  the  encoding  process  to  agree  with  that  estimate.  Without  feed¬ 
back  the  capacity  cannot  exceed  B%j  <  B /  .  The  next  theorem  states  that  C  —*  Bf  as  T  — *  oo  if 
causal  feedback  is  allowed. 

Theorem  7:  For  the  Poisson  channel  with  marginally  stationary  noise  intensity  A,  belonging 
to  <I>m  or  Qg ,  peak-constrained  encoder  intensity,  and  causal  feedback,  C  — *  Bj  as  T  — ►  oo. 

Proof:  As  in  the  proof  of  Theorem  6  we  have 

T 

lim  lira  ^=IT{0(m),Y\  =  lim  -^fE  [A  (p  ,A,  )]rft  . 

m  -*ooT -*oo  1  T  —*oo  1  Q 

Choose  p  —  k(k  t)/c  and  check  that  C  (x )  —  h  (k  (x  )/e  ,x ).  For  Ae  e  U 

lim  E  [A  (k  (A  f  )/e  A, )]  =  lim  E  [<7(A, )]  =  E  [C(A, )]  . 

t  — *oo  t  —♦00 


So 


lim  C  > 

T  -*oo 


lim  lim  -|L/7'[<rf,",,y]  =£[<7(^)1. 

T  -*oom  — »oo  1 


This  completes  the  proof. 


IV.  Improvement  Afforded  by  Feedback 


For  Poisson  channels  with  nonrandom  noise  intensity  Bj  =  Bnj  consistent  with  the  fact 
that  causal  feedback  does  not  increase  channel  capacity  in  this  case  [l].  For  stochastic  noise  inten¬ 
sity,  however,  Bj  ^  Bn/  in  general.  Thus  for  Poisson  channels  with  random  noise  intensity  At 
the  fractional  difference 

B,  ~B«, 

Bf 

is,  in  the  sense  of  Theorems  4,  5,  6,  and  7,  a  measure  of  the  improvement  afforded  by  the  use  of 
causal  feedback.  Let  At  be  marginally  stationary  and  let  L  be  a  random  variable  with  distribu¬ 
tion  F(z)  =  P  {A|  <z  }.  Define  D  to  be  the  set  of  all  such  L  ,  i.e.,  D  is  the  set  of  all  nonnegative 
random  variables.  We  shall  show  that 

~~-L  =  0.0163  . 

This  means  that,  while  causal  feedback  offers  some  improvement  in  the  case  of  stochastic  noise 
intensity,  the  improvement  may  never  be  very  large.  In  particular,  in  the  cases  where  Bj  and 
Bn/  are  the  channel  capacities,  the  improvement  afforded  by  feedback  never  exceeds  1.63%  of 
that  achievable  without  feedback.  An  analytic  expression  is  obtamed  for  the  above  supremum  by 
way  of  the  following  series  of  lemmas.  W.l.o.g.  we  take  e  =1. 

Lemma  8:  Let  Db  be  the  set  of  all  bounded  nonnegative  random  variables.  Then 

sup  B1  ~B*I  =  sup  B!  ~B'l 
ieb  Bf  i€fc,  Bf 


Proof:  For  any  nonnegative  r.v.  L  and  positive  real  number  p  let  Lp  —  L  l{i</?}  +  >p}- 

Then  Lf^Db.  Write  Bj  (L )  and  B%t  (L )  for  Bj  and  BmJ  evaluated,  respectively,  for  a  margi¬ 
nally  stationary  noise  intensity  whose  marginal  distribution  is  that  of  the  r.v.  L  .  We  have 


B,  [L)-  Bn,  ( L )  Bj  (L  f)  -  Bn,  (L0) 

Bj  (L )  "  Bj(Lff) 

Bn,  (L)  B%,  (L )  B„(L)  Bn/  (L„) 
Bj  (L)  Bj  (Lf)  +  Bj  (Lfi)  ~  Bj(Lfi) 


Bn,  (L) 

-  Bj  (L  )B,  (Lfi) 


Bj(L)-B,(Lj,)  \  + 


1  B%,  { L)-Bn ,  (Lfi)  | 
B,(Lfi) 


Now  C(  )  is  a  decreasing  function  so 

E[C{L)\  <  E[C(Lfi)\  <  E[C{L))+  C  (0)P  {L  >  0}  . 

Then,  as  0  — *  oo, 

\  Bf  (L)  -  Bf  (L/f)  |  =  \E[C[L))-E[C{L,)}\ 

<  C(/3)P{L>/3) 
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and,  for  |  x  |  — >oo, 


so,  for  X— »oo, 


( P  >x)  =  P  Ml  +  1/X)  -  M1  +  P  /X) 


ln(l  +  l/» )  =  7  -  7T  +  0  (*‘2) 

z  2a: 


x2^(fA)  =  -^£2  +  £>(1). 


Thus,  for  all  X  large  enough,  A  (p  ,X)  is  decreasing  in  X.  Then,  for  0  large  enough, 
E  [A  (p  ,L)\  <  E  [A  (p  ,L  ,)]  <  E  (A  (p  ,L  )]  +  A  (p  ,0)P  {L  >  0) 


so  that,  as  0  — ♦  oo, 


|  (L  )  -  BaJ  (Lt)  |  =  |  ““E  [A  (p  ,L  )]  -  m™E  [A  (p  ,L,)]  | 


<mpax|£[A(p,L))-£[A(p,L^)]  |  . 
<m“A(p  ,/?}/>  {L>/?} 


=  C(0P  {L  >  0} 


This  completes  the  proof. 

Lemma  9:  Let  D2  be  the  family  of  all  random  variables  L  of  the  form 


for  0  >  0  and  e  e  (0,1).  Then 


{0  w.p.  e 
/?  w.p.  1-e 


sup  El  ~Bnt  =  sup  B>  zEll 

l*Dk  B  l*D2  B, 


Proof:  Suppose  L€  Z>t  and  let  0  —  esssupn  L  •  There  is  a  unique  L2*  D2  with 


{0  w.p.  e 
0  w.p.  1 


{0  w.p.  1-e 

for  some  e  €  (0,1)  such  that  E\C(L)]=  E\C(L<$.  Moreover  Bnf  (L2)  <  Bnf  (L)  so 
B,  (L  )  -  Bnf  (L  )  <  B,  (L2)  -  B%1  (L2).  Thus 

Bj  -  P,/  <  Bj  -  Bnf 

Bf  L~  Bt  ** 

Noting  th..t  D2  C  Dh ,  we  get  the  desired  result. 


may  be  written  Bnf  =  E[h(p  mu,L ))  where,  by  (2),  p  ro»*  is  obtained  from 
E  [ln(p  „„  +  [)]=£  [ln(A  (L )  +  L  )] .  For  L  e  D2,  pm„  is  a  function  of  e  and  0.  Its  partial 
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derivatives  with  respect  to  e  and  y?  can  also  be  obtained  from 
E  [ln(p  m„  +  L  )]  =  E  [ln(Jt  (L )  +  L  )]  by  implicit  differentiation.  Then,  by  the  usual  methods  of 
differential  calculus,  the  function 


/  M)  *= 


B,  -fl, 


«/ 


B , 


|l/2 


can  be  shown  to  approach  its  supremum  over  {(e,/9) :  0  <  e  <  1,  P  >  0}  as  e 
Thus, 


»P  El^L.  _  Jim 

L  ^  Bj  Bf 


L< 


0  with  0  =  (e  e)-1. 


where 


L(  = 


0  w.p.  t 
—  w.p.  1-e 

t  £ 


Lemma  10:  For  t  >0  and  L  —  Lt, 


Bf 


8+c2 

8e 


(  +  o  (e). 


Proof:  For  L  =  L(, 

B,  =  <C(0)+(M<7(-±-). 

By  Th.  2  in  [3], 

Using  C(0)  =  1/e  ,  this  leads  to  the  desired  result. 

Leri  -  >a  11:  For  «  >  0  and  L  =  L(, 

B,f  =  «p{(l+epe/2)  +  o(e) 

where  p  t  is  the  value  of  p  in  [l/e  ,1/2]  which  maximizes  H(p  ,L)=  H{p  ,Lt). 

Proof:  Define  K f  =  k  ( L( ).  Then 

Bnf  =  P(E[4>(1,L()]  -  E  [<f>(p  (,L  () ] 

—  P<  +  P (E  [In (K(+L ()]  -  E  {(p  (+L ()ln (p (+L  ,)]  +  E[L {In£ , ] 

=  P(  + p({E{ln{K(+Lt)]-E[ln{p'+Lt)}}-E{LMP'+L<)\  + E[L(\nLc}. 

Recall  from  (2)  that  E^n{K t+L  t)}  =  E  [In(p  Thus 

Bnf  =  p(  -  E  [Ltln(p  r+L()\  +  E  [L,lnLt] 

- '  ■  -  (i-‘)7rli‘(',-+7r> + (i-,)7r1“Tr 

=  Pi  ~  (l-f)~‘bi(l+e  cpt). 
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Using  In(l+x )  =  x-x2/2  +o  ( x 2)  the  desired  result  follows. 


Theorem  12: 


sup  _ 


L*D  B, 


where 


p0  =  limp*  =  0.436815 
€~*0 


satisfies  Inp0  +  ep0  =  e  /2  -  1. 


Proof :  By  the  preceding  lemmas, 


sup  Bf  ~Bu/ 

‘■^sr-^sr  t,= 


+  o(l). 


Thus  all  that  remains  to  be  shown  is  that  lnp 0+  ep0—  e  /2- 1. 
E  [ln(/f  <+.£,<)]  =  E  [ln(pt+Lf)]  from  (4)  we  get 

(l+-i7XMMl+«  <)  -  1  =  «lnp,  +  (l-e)ln(l+e  tp(). 

The  LHS  of  this  equation  can  be  expressed 

LHS  =  (f-l)c  +  o(c) 

while  for  the  RHS,  using  p  t  =  p0  4-  o  (1),  we  get 

RMS  =  elnpo  +  e  ep0  +  o  (c). 

The  proof  is  complete. 


Using 
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V.  Encoder  Output  Constraints 


The  usual  type  of  constraint  imposed  on  the  encoder  in  determining  the  capacity  of  the 
Poisson  channel  is  a  peak  constraint  [l],  [2],  [3]: 

0  <  x<  <  c  ,  Oe[0,r]  (3) 


or  an  average  constraint  [l],  [3]: 

T 

E[fX.ds}<k0T  .  (4) 

o 

In  this  paper,  also,  we  have  used  primarily  a  peak  constraint  on  encoder  output  intensity.  This 
emphasis  in  favor  of  a  peak  constraint  is  now  given  some  justification. 

(3)  and  (4)  are  constraints  on  the  intensity  (and  compensator)  of  the  encoder  output.  One 
might  rather  consider  constraints  imposed  directly  upon  the  output  Xt  of  the  encoder.  We  write 
Pe  to  signify  the  class  of  conditionally  Poisson  processes.  For  X  €  Pt  we  have  [4,  p.  407] 

Var[Xt]  =  Var[At}  + E  [At] 

where  A,  is  the  compensator  of  Xt .  Thus  for  X  €  Pe  the  constraint  Var  [Xt  ]  <  P  is  equivalent 
to  the  pair  of  constraints  on  the  encoder  compensator: 

E  [Aj]  <  k0 

Var[AT]  <  V 


with  0 <k0<P  ,  0<  V  <P  ,  and  k0+  V <P  . 

For  clarity  let  us  write  C(c  ,k0,V ,P)  for  the  capacity  given  the  following  constraints  on  the 
er  coder  output: 

0  <  X«  <  e  , 

0  <  E[At]  S  *0 , 

0  <  Var  [A^]  <  V  , 

0  <  Var  [Xj  ]  <  P  . 

To  exclude  a  constraint  the  corresponding  argument  of  C(y, y)  is  set  to  oo;  <C(oo,y,-)  is  the 
channel  capacity  w'ith  no  peak  constraint  on  the  encoder  intensity,  etc.  A  subscript  is  used 
C p  to  indicate  that,  for  the  encoder  output  X ,  we  require  X  €  Pe .  Using  this  notation, 

we  have 

CP'{oo,oo,oo,P)  =  Cpc{°o,k0,V,oo) 

where  R  —  {(x  ,y ):  x  >0,  y  >0,  x  +y  <P  }. 

We  now  show,  for  A,  =  X,  that 

<C(co,oo,oo,P )  =  oo  . 


The  proof  uses  several  lemmas. 


Lemma  IS:  Let  0t ,  0 <t  <T  be  a  random  telegraph  signal  (2);  i.e.,  a  left-continuous  homo¬ 
geneous  Markov  jump  process  with  states  {0,1}  and  infinitesimal  parameter  matrix 

-m  m 

m(l-p)  m  (l-p  ) 

P  P 


1 00  ?0! 
7  io  7  ii 
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Set  P  {0q=1}  ==  P  ■  Then  0(  is  stationary,  mean-square  continuous,  E  [0t  ]  =  p  ,  and 


##(*  .0  =  E{0,  0,  ]  =  p2  +  p  (1-p  )e  9 


Proof:  Using  some  obvious  notation 

E[et]  =  p{9t  =1} 


=  p0i(t)(l-p)  +  Pu(t)p 

=  L>  -pe~‘](l-p)  +  [p  +(1-P)e"^‘]| 


=  P 


Consider  s  <  t .  Then 


R,(t  ,<)  =  P{9t  =1,0.  =1} 

=  P{0t_,  =l,0o=l} 

=  -Pn(*-«  )P 

Lemma  14.  Suppose  0,  is  the  telegraph  signal  described  above  and  let 

< 

At  =  f  e  9,  da  . 

0 

Then  E  \At  ]  —  pet  and 

)  =  ,v,.  +  +  «Vd-f)  [,-r  +e-7' 


m 


m" 


Proof:  We  just  note  that  0,  is  mean-square  continuous  so 


«  < 


RA  («,<)  =  ffc2Rt(v  ,v  )dvdu  . 


0  0 


Lemma  1 5:  For  j4j  and  0,  as  above  and  f  e[0,  T ],  Var  \At  ]  — 1 ►  0  as  m  — »oo. 
Proof: 

Var[At]  =  2-^— p2(l-p  )l  +  [e~*  -  l]  . 


m 


m 


Lemma  16:  C  p(e  ,k0,V,oo)  —  C  p(e  ,Jt0,oo,oo)  for  all  U>0. 

Proof:  Make  explicit  the  dependence  of  0t  on  the  parameter  m  by  writing  0,  =  0/m  \  Then 
with  the  right  choice  of  p  [3], 

lim  ^IT{&m\Y}  =  CP  (c  ,*0,oo, 00)  . 

m  -*oo  l  * 


-  15  - 


For  any  V>0,  there  is  an  m0  such  that  Var  \At  ]  <  V  for  all  m  >m0-  Thus 
Cpc{e  ,k0tV,oo)  =  Cpt{c  ,k 0, 00,00). 

Theorem  17:  Let  A*  =  X  be  a  real  constant.  Then  C(oo,oo,oo,P)  =  00. 

Proof:  Choose  (k0,  V)  €  R  .  Then 

C(oo,oo,oo,P )  >  C pc (00, 00, 00, P) 

>  Cpc(oo,k0,V,oo) 

>  lim  C p  (e  ,k0,V  ,00) 

c  — »oo  c 

=  lim  C p  (c  ,k 0,00,00) 

c  —*oo  c 

=  lim  C(c  ,k0,oo,oo) 

C  — *CO 

=  lim  k0  ^(c  ’^)  -  <t>(k0,\) 

c  — *00  C 

=  lim  *0  [ln(£(X)+X)  +  l]  -  <f>(k0,\) 

c  -*oo 

>  lim  k o  [ln(c  /e  +  X)  +  1]  -  0(*OA) 

C  — *00 

=  OO  . 


The  above  approach  shows  as  well  that  C(oo,k0,  V  ,P )  —  00  for  all  Jt0  >  0,  V  >  0,  P  >  0. 

To  further  indicate  the  importance  of  the  peak  constraint,  recall  the  following  capacity 
result  from  [l]:  let  the  noise  intensity  be  nonrandom  and  suppose  0  <  Xt  <  e  (f )  where  c  (t )  is 
chosen  freely  subject  only  to 

T 

4 fc(t)dt  <p 
1 0 

for  some  given  P  >  0.  Then  C  =  P  /e  .  If  this  encoder  constraint  is  reformulated  to  eliminate  the 
peak  constraint;  i.e.  the  similar,  but  weaker,  constraint 

T 

4/x,  dt  <P 

1  0 

is  imposed,  then  the  capacity  is  infinite. 

Thus,  it  appears  that  a  peak  constraint  on  the  encoder  intensity  or  its  equivalent  for  the 
encoder  compensator  is  needed  -  at  least  for  the  group  of  constraints  considered  here  -  to  give  a 
well-posed  capacity  problem. 
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VI.  Summary 


Significant  differences  exist  between  Poisson  channels  with  nonrandom  noise  intensity  and 
those  with  stochastic  noise  intensity.  In  the  nonrandom  case  it  is  known  [l],  [2]  that  causal  feed¬ 
back  does  not  increase  channel  capacity.  By  contrast,  we  have  shown  that  causal  feedback  does 
increase  capacity  (though  it  seems  by  only  a  small  amount)  when  the  noise  intensity  is  random. 
Also,  expressions  are  available  [l]  for  the  channel  capacity  in  the  nonrandom  case  for  T  <  oo 
while,  for  stochastic  noise  intensity  and  T  <  60,  only  least  upper  bounds  have  so  far  been 
obtained.  These  differences  arise  out  of  the  simplification  of  the  expression  for  average  mutual 
information  that  is  possible  in  the  case  of  nonrandom  noise  intensity.  In  the  case  of  nonrandom 
noise  intensity  A(  =  \{t ),  the  expression  for  the  channel  information 


1t{6,Y ]  =E 


T 

/WxtA  t  )-<KxtA  ))dt 


0 


(5) 


simplifies  to 


It\0,Y)  =  E 


'T 

fWxi  ,M< ))  -  <t>{xt  )))<** 


Lo 


For  stochastic  noise  intensity  A(  and  A  <  remain,  making  (5)  relatively  intractable.  Only  for  a  cer¬ 
tain  class  of  noise  intensities  can  much  headway  be  made.  For  A»  e  3>wu  ,  it  was  shown  that 
for  the  Poisson  channel  with  feedback 


lim  G  =  E  [  C  (A, )] 

T  — *00 


while,  without  feedback, 

KmC  =  ( PE  \<j>(e  ,Kt)]-E  [4>(pc  A)])  • 
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